Tandem processing of fepstrum features

نویسنده

  • Vivek Tyagi
چکیده

In our previous work [1, 2], we have introduced Fepstrum an improved modulation spectrum estimation technique that overcomes certain theoretical as well as practical shortcomings in the previously published modulation spectrum related techniques[7, 8, 9]. In this paper, we provide further extensive ASR results using the Tandem processed Fepstrum features over the TIMIT corpus. The results are compared with TRAPS features derived from hierarchical and parallel structures of neural networks[3]. Unlike the multiple neural networks trained over multiple time-frequency patches or the frequency bands as in [3], we train a single neural network with the concatenated Fepstrum and MFCC features to derive Tandem(Fepstrum+MFCC) features. The resultant phoneme recognition accuracy of the concatenated Tandem(Fepstrum+MFCC)+MFCC feature is 76.5% on the TIMIT core test set and 77.6% on the complete test set making these one of the best reported results on the TIMIT continuous phoneme recognition task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fepstrum Features: Design and Application to Conversational Speech Recognition

In this paper, we present the Fepstrum features – a principled approach to estimate the modulation spectrum of the speech signals using the Hilbert envelopes in a nonparametric way. The importance of the modulation spectrum as a feature in the automatic speech recognition (ASR) has long been established by several researchers in the past twothree decades. However, traditionally, in the speech r...

متن کامل

Fepstrum: an improved modulation spectrum for ASR

In our previous work[3, 4], we have introduced fepstrum; an improved modulation spectrum estimation technique that overcomes certain theoretical as well as practical shortcomings in the previously published modulation spectrum related techniques[11, 13, 14]. In[3], we have also shown that fepstrum is an exact dual of the well known quantity, real cepstrum. In this paper, we provide further exte...

متن کامل

Comparison of AM-FM based features for robust speech recognition

Effective feature extraction for robust speech recognition is a widely addressed topic and currently there is much effort to invoke non-stationary signal models instead of quasi-stationary signal models leading to standard features such as LPC or MFCC. Joint amplitude modulation and frequency modulation (AM-FM) is a classical non-parametric approach to nonstationary signal modeling and recently...

متن کامل

Data Mining for Identification of Forkhead Box O (FOXO3a) in Different Organisms Using Nucleotide and Tandem Repeat Sequences

 Background: Deregulation of FOXO3a gene which belongs to Forkhead box O (FOXO) transcription factors, can cause cancer (e.g. breast cancer). FOXO factors have important role in ubiquitination, acetylation, de-acetylation, protein-protein interactions and phosphorylation. Understanding the regulation and mechanisms of FOXO3a can lead to cancer treatment. The aim of this study recent association...

متن کامل

Tandem Synthesis and Optical Rotatory Dispersion Studies of a Novel Spiro Lactone (methyl 3-(benzo[d]thiazol-2-ylamino)-7,9-dimethyl-2,6,8,10-tetraoxo-1-oxa-7,9-diazaspiro[4.5]dec-3-ene-4-carboxylate)

Spiro compounds are of interest due to their interesting conformational features and their structural implications on biological systems. The asymmetric characteristic of the molecule due to the chiral spiro carbon is one of the important criteria of the biological activities. These structures are a widespread structural motif found as key elements of numerous drugs and designed medicinal agent...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008